PGT/KR 2004/6 0 0 7 52 




This is to certify that the following application annexed hereto 
is a true copy from the records of the Korean Intellectual 
Property Office. 



g §j yj S 
Application Number 

§ g ^ | | 
Date of Application 



10-2003-0020023 



2003^ 03M 31 SJ 
MAR 31, 2003 



PRIORITY DOCUMENT 

SUBMITTED OR TRANSMITTED IN 
COMPLIANCE WITH 
RULE 17.1(a) OR (b) 



si 

Applicant(s) 





■ ■■ < 

• n 9 ;f 



— 7 v;. . 



11 ' . • ' f, . ■ ■ » t 



-V 



ALGENTECH , et a I . 



2004 



*1 



04 



2S 



29 



COMMISSIONER 





Si 


:.f51Q3Sa!( 






0023 



^ 2004/5/7 



[AH US] 
[4^*11 

[§§!21] 
[SSI 

[f?!2!3£] 

13 91 
[£JAH!^] 



] 
) 
] 



mm 

mmm® 

2003.03.31 

Genes involved in the biosynthesis of carotenoids 



S9EH 

4-1998-039808-4 



OIXHij 

4-2003-011731-1 



1-2001-008962-2 



9-1998-000385-9 
2003-019988-1 
2003-019853-9 
2003-004687-1 



2J9EH 

4-1998-039808-4 



OIXHS 

4-2003-011731-1 

&^oi4Bga^Ma (kccm) 

KCCM- 10460 
2003.01.24 



46-1 




0023 



53 Ol 



X\: 2004/5/7 



[£SAHJ?S] 



^1 

15 



XH 425 21 Oil 21 » SS, XII 605 2| ff§0« 2| 



^51 ^ 
0I«S| (SI) 



20 
33 
0 
16 



29,000 S 

33,000 a 

0 S 
621,000 @ 



683,000 9 

5^1 a (70%ae) 

204,900 a 

1. fl2f/H- SA||AH(Ee)_1 



2.0I^M^I^e9AH_1S 3.5^ISS 



46-2 




lOal K0023 



% 2004/5/7 



1± ^>S.E]}^olH. ^tH^l ^^-fe- -B-^Hl ^ ^ °-S.*\ , iL^f 

^>S.B)ico]^ ^^Hr ^^Jl 2, A^^Ji 4, ^^Jl 6, ^1 

^31 8, Al^^iS 10 ^ ^i^^is 125. 7l7fl£|^ -B-^V* S^Hfe ^m^ico]c -n-^. 

» 



£ 8 
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7>S.B^i^o]H.^] ^£ L ^l"^-^ 6 !! ^r^^l-c- -Tr^KGenes involved in the biosynthesis of 



carotenoids} 



£. l^r 7>S.B|ic.ol^^ S-Al^- 

£ 2fe ^tflAl^S^M ^-El^ DNA €-*fl*r^ ORFCopen 

reading frame)^ <>Mi:t> <*^4 ^e^Ml^ 4* ;a ^ s^ti] -g- <tp 

p-^>S.€ ^l]S.5ll ol^(p -carotene ketolase; crtW)£] oMk^ «]SLt!: ^3*M 

P. haeundaesis: ^e^-s^i tfl^tflA]^ 
Alcaligenes.sp: ^el^Nl^ ^ 
Bradyrhizobiunusp: Jl^M 212^1 -g- ^ 

Consensus: U*^]-^ Or^li^ ^ 

£ 3£r 4^^^ ^fl^tflAl^^^l ^ej^ DNA ^fls'Hr- -T- ORF^ o^v^ 

a> A^if <£z};B};*f|v|}>i ^ ol^-§-3.^-iH p-^>S.^i *>o]^-s.^eflc>l3(p -carotene 
hydroxylase; crtZ)^ o>p1^a> v]ZLi& &5fyo]t\. m 

P. haeundaesis: 4^-3^ Sfl^tflA]^ 

Alcaligenes_sp: 
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Consensus: ^J*|3|-fe ^lic^V a] 



S. 4^ 4^71 i ^^tflA]^^«-&| -g-^^ DNA 4| ORF^ 

a> a^jz)- s-ej-^^-B)] e) -g- ^ u^-g-^JjL^ ^-ej^ ^S^l ^>°l#^l°l^(licopene cyclase; 

crtY)^| ^14- ^^Kl 1 }. 

P. haeundaesis: *fl£tflAl^ 
Flavobacteriunusp: 3 -g- # 

Consensus: o>nllc& -H*! 

£ 5^ ^^cflAl^^E^ ^Ej^ DNA ^ffl €r*i|^Hr vfl t£*fl ORF^ 

av a-^iz}. e] -g- ^ pI^^-S-W-b^ ^-ej^ s]M<ffl t\ *\}^-$] o]2:(phytoene desaturase; 

crtl)^ ^liv*]- ^\^-%: w]iiLt!r ^4°!^. 

P. haeundaesis: ^^cflA]^ 

Flavobacterium_sp: ^^Ji 1 ^ el 

Consensus: 11*1 o>f licit *\<g. 

S. 6£r sfl^tflAliS.-^-^ ^-el^ DNA ^^Vfe- 4^ ^*fl ORF^ a>*l 

lc<+ a^3|- s.eV^.^flej^- ^ t^^S.^ ^H^Cphytoene synthase; 



CrtB)^ <>Hlc^V A^^- *)5Li& ^3±o]t\-. 

P. haeundaesis: ^3.9]^ Sfl^rtflA]^ 
Flavobacteriunusp: ^JOHM ^1 4p 
Consensus: ^^1*>^- ^lk^ -M^ 
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10^^E0023 *3 ^^V: 2004/5/7 

S. 7-& 4e}s^i tfl^cflAli^Bl ^-e^ DNA ^^Ml ^fl^Hr <*l^ ^ ORF^ o\u] 

(geranylgeranyl pyrophosphate synthase; crtE)^1 *|<i^§- v]5L$: ^t^l^r. 
P. haeundaesis: ^3.7]^ *fl£-3M^ 
Flavobacterium_sp: 4=- 
Consensus: oMicft *\<£. 

S. 8^r sj-ej-s^i ^-grtflAl^S-^-Bl ^-5]^ crt -rr^AQ ^(organization)-!: 3EA]sj- 

S. 9^ pCR-XL-TOPO-crtfull 7fl^^]iE» i~hW 

#•3} si ^^-S. 190-890 nm:A ^ 350-550 nm :B)-& i7lN3r ^4* 450 

nmSl 3t)Hfe JL-fi- s)aL°lJL, 470 nmS} Or^Bj-^S} jl-fi- si) 3. o] t}- . 

[*^^ ^] 

<30> . ?|-^B|2nOl^ ^tMH ^r^^ -fr^HI ^ -9.3.^ , ^-^f ^M*>7fl^ 

7Vsb|^o1= ^^o)] ^Att ^t^j: 2, ^<gwis: 4, 6, ^t^s 8, ^It^s: 10 
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gin 

<3i> 7>S.B|t^^(carotenoids)^ 7>^lZL C 40 °1^5ll^ 0 l^- 

(isopenoid compounds)^, ^ ^-i^M Sife tb ^3 ^ 

3.^- £-?>3.€(p -carotene; ^i), e| a^fl ( 1 i copene ; £^£^1 ^ 

^fiS#€(fucoxanthin; sfl^Mr^ %^ SE^r ^ ^i) ^4. ^SB^l^ oisfl tfl 

oflA-i H)Ef^i a^I sIt^s*^ qit% «H. ^ %>x|jL3)-sq- -fr-sA^ i7i*f-g-, ^ 
q e^l^-g- gj ^<9- ^ >sfl sj-g-o] ^ ^Sl^ f-^r <W«H=- A 

3'-dihydroxy-p , p-carotene-4, 4 , -dione)^r SE^- iL^]^ 

<33> [J^*H 1] 

0 

OH 

HO || 
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<34> o>^Bj-^^r ^3, xfl-f , S^l^Kred seabream), <$.°\ (salmon) ^ v}^}^ 

(lobsters) ^ n<$ ^ €MI*rfe ^(Fujita a/., Nippon 

Suisan Gakkaishi., 49: 1855-1869, 1983; Johnson, E. A., Crit. Rev. Biotechnol., 11: 
297-326, 1991; Nelis ef a/., /. 4flp/- Bacteriol., 70: 181-191, 1991). o>i^^^- ^ 

^15.^ 2r:5h ^ ^«3-t- -fr^Rr ^>-§-^r ^ ^ ^ ^W^, -n-^ ^(hydroxy or 

peroxy radicals)^) -fl-^-i- ^l^r^- ^-S-ir ^-(Palozza etaJ., Arch. Biochem. Biophys., 
297: 291-295, 1992; Shimidzu ef a/., F/sA 5c/., 62: 134-137, 1996). StL 

^% #^( immune modulatory activity) % 2^ ;M-(cardioprotective effect)* 

3Hr ^B^tKJyonuchi ef a/., Nutr. Cancer., 19: 269-280, 1993). -^1 , °\^^r 

Q<>) *j-A>^-^^- ?>S.B)^ol^ o) 10H fl o^olji, q -S^5fl#( tocopherol)^ 100*11 
Si 3} 0.3. <£e};3 &th 2.^, Qt$77\7] o\2^QQs>\ (toxicity)^ tfl*||X^ ILJi^ 

i 

$rf. icJ-eJ-a^ oy^B)-^!:^^: ^1^^$ (neurodegenerative diseases), "^(cancer), ^^€^r 
(immune disorders), 43 ^"t^^^K cardiovascular diseases) XtN-fe *1 

* 

3. ^ ofl«<H *3Ua. atr °H tfltt "£^7} 31^ s]3l ^i=r(Beal, H. F. , Jjfe 

i 

Neuroscientist, 3 : 21-27, 1991; Chew et al. , Ant i cancer Res . , 19: 1849-1853, 1999; 
Murillo E., Arch. Latinoam. Nutr., 42: 409-413, 1992). S=3h oV^^r^ ^ 
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^ 2004/5/7 

» 



<35> ' ^e^] s^^r-sl-S-^KF. Hoffman-LaRoche) 6>^^^ ^ 

■i: ^l^S-I-a- 6 >^^:€^1 *H^i| -#"33 *43°1 ^ Si^- °>^^r 

^-i: ^-£r*rfe- ^l^-i-^^- £.3-91 s)-5r)6> S5.^1f>(/%a///a rhodozyma', Miller 

ef a/., /of. /. Sysr. BacterioL, 48 : 529-536, 1976), ^2:^ ^IWSS^ 
^{Haematococcus pluvialis; Bubrick, Bioresour Technol., 38: 237-239, 1991), ^L^-^S 

10Z(Brevi bacterium 103; Lizuka & Nishimura, /. 6en. AppI . Microbiol., 
15: 127-134, 1969), n^t—- °>H.S.iii-i3]e|-g- g^^bIo}^ (Agrobacterium aurantiacum', 
Yokoyama a7., £/<?sc/\ Biotechnol. Biochem. , 58: 1842-1844, 1994), sj-e}-^^ ^V^^a] 
°\°Waracoccus marcusii ; Harker a/., /. Syst. BacterioL, 48 : 543-548, 1998) 

^ s^-S^^ ^^^^^^^^^(/araccccus carotinifaciens; Tsubokura et al., Int. J. 
Syst. BacterioL, 49 : 277-282, 1999) ^1 9X^. 



<36> 2)e 6M 7>S.B)^o]c Aj^-Ajofl ^S}^- Sl^S"^- S^}^ tt^*}*^ tfl 

^Sl&.o^, 7 ]^o] fl-^£| ^(Armstrong, G. A., /. BacterioL, 176 : 4795-4802, 

1994; Sandmann, G. , Eur. J. Biochem., 223: 7-24, 1994; Wieland, B., /. BacterioL, 176: 
7719-7726, 1994). ^S.B)^o]^. ^S.^- efl v^o] = ( i SO prenoid) ^S.^ 

^-S-tF ^ ^^1-^1 FPPCfarnesyl pyrophosphate)^.^ sj-^c}-. s. 1* ^Sfj}^, FPP 
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£}- IPPCisopentenyl pyrophosphate)^ crtE^ £}*fl ^l^^^ ^S.^— 

<OB : |]ol^(geranylgeranyl pyrophosphate synthase)^! GGPPCgeranylgeranyl 
pyrophosphate)^. €*=h <W, GGPPfe crtB^ ^ Qsl^tt 3\^<& 413H^(phytoene 
synthase), crtW ^\^\ tyS-Q^l^: ^lTf-^H^phytoene desaturase), ^lz]5L crtY 

^ ^JlsJ-sI^ *M#3H^(lycopene cyclase) 6 !! ^«fl ^r-s-^- 

5.^3. ^^r^^-. p-^VS^l^r crrJN #3:3^ p-^r-S-^l -carotene 

ketolase)^ crtM ^ ^JlSrSl ^ p-7>S.^i £r°l— 3H^(£ -carotene hydroxylase)^ 

tiV-g-^: 7^ o^b^^I 0.3. 
<37> 7>S.B)ic.o]^ <g$M§efl ^c^^>^ err -B-^>(carotenogenic gene)^ ^Hl, =?- 

^(organization) ^ ^-S 6 1 ^ ^ (Rhodobacter 

capsulatus) (Armstrong a/., £en. Genet., 216 : 254-268, 1989)5+ o|H^t)d} isflsti] 

^^{Erwinia AerZ>/a?/a)(Sandimann er a/., FEMS Microbiol . Lett., 71: 77-82, 1990; Hundle 
e£ a/., Photochem. Photobiol., 54: 89-93, 1991) ^ <^]s.^M^> ■ £ h^£.£-Q(Fnrinia 
uredovor a) (Mi sawa e£ a/., /. Barter iol., 172 : 6704-6712, 1990) «> Si 1 }. 

♦ 

5Eth cr^ f cr^Z, cr£7, crt W ^ c/^ZS. ^-f-^^ 7r5.El^o"|c cr/ 1 -fr*l*r7> 3f)<£*l 

aJI-o] o>n5.^-Bll e] -g- o]~f ^b|o>^ o.s.^-b! &°.i*|(Norihiko e£ a/., /. 

Barter iol., 177(22): 6575-6584, 1995), GGPPS.^ £-7}3.3l7r*l^ !£-§--§- Jl^ 

37flsl ^A^crtB, crtP^ crtY)A S.£^1^H tfi*fl 
iL:2^1 ^(WO 97/23633) . 
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-fr^^Hrfr -gr^l^M Jfl^fl tItH}^ ^, crf£, cr^tf, cr£7, cr^7, 

crtW% crtZ -rr&x}, =LZ)il S.^}^ art #3.^«H ^ ^Hl* fl- 

<39> -g- l^S} ^-^^r ^VS^icO]^ ^^^1 ^.^-^ 3.^*}^ -^-^7>* Xl^}^- 

<41> uj-47}, t^S] £ T}-& -E-^ ^71 -R-^7}# o] -§-^M 7>5.B|^ 0 1^» *o V 



^*>fe- aII^Jl 2, ^i^^^: 4, ^il^s. 6, ^il^is. 8, ^i^s. 10 ^ ^t^Jc 125. 

<43> St!-, It^^r -^-7] -^-^7># 3Lf-*>^- ^>S.B)^o]^. ^-^^># ^1^^. "P" 

^ ^^^r A^alJ: IS. 7]7fl£|^ <§7H<i,g- £±r crt ^-^7\% A^r%^\. 

* 
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<46> o^, JfrXfttf 

<47> -S- ^>S.E)^o]c ^^ofl ^.fi.^ 5I^Kr 2, ^^Jl 4, 

Ait^Ji 6, AUtSlJL 8, a^^s 10 * ^^3: 125. ^l^^l ^"H^ 
<48> #7] -B-^^ ^e^b^ ^^>§>^ 3i|-a)-s^^ «fl^r3Hi(7l^«lJi: KCCM-10460) S^- 
<49> ^tgo) 7>s.e1^c>]h. ^A^: 3.^^Hb -fr^Wfe A|<^tf|J: 2, 

^s. 4, ^II^J: 6, 8, a^^ijl 10 ^ ^^j: 125. ^sq^ 

-B-^>^1 «>^^. -^71 a^i^ 2, A^^is 4, ai^je. 6, *)<ii£:3L 8, ^i^^is 

10 ^ 12S. 7l7fl5l^ A-^^ 3, 5, 7, ^ <gwj 

.J: 9, ^<g^j: ii * A^^i^: 13-^.s. 71^^^- o}*lic^i- ^-Jisl-^V^ a^o-|c|-. m. 

■ 

67fl^ -B-^^> ^ olS-W-E] 53.^ ^^^<^] tfl^M ^>7l a 1^1 7)^}^. 

<50> 



« i 
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[5. 1] 









— 


a-II^ 2 


crtl/f 


(JP -carotene ketolase) 


Ail^is 3 


a) £312- 4 


crtZ 


(P -carotene hydroxylase) 


AllStJL 5 


A^iL 6 crfK 


5}.25§ ^M-l-islMSUicopene cyclase) | A^sia: 7 


Ail^SL 8 


crti 


(phytoene desaturase) 


Ai^^JS 9 


Alibis: 10 


crtB 


3|S. < ai -tl^||o|2:(phytoene synthase) 


*\<g.?13- 11 


Ait^ 12 


crtE 


(geranylgeranyl pyrophosphate synthase) 


Aj<asiSL 13 



ATCC96815)<*fl S^sH °r^#^ A^AV^cfl A>-g-^ ^ 5&th 

7>5.B)^o")=. a^aj -R-^}(carotenoid synthesis gene, °1*> 'erf -fr€ 

* 

TsV^ ^M)^)^ FPP(farnensyl pyrophosphate)^.^ o^e} ^^Vfe- 3M*rfe 
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n^;tol;EL -B-^V^-ir S-f-W. £ crt -fr^W^ ^ 8°fl S. 

^1^4. £ 8*1H Jife ■& ^3 erf -fr^Hr 6,223 bp^l 3.71 oH, zz. ifl^ 

ofl^ 5'->3' crtF, crtZ, crtY, crt I, crtB ^ cr^ -B"^>7> ^tfls. $1 

x\. at!:, H vfl-Mfe , Smal , Xma\ , Clal , HinSSL ^ BanE I ^l^^i'M^l 

&tj-. cr^r, crfZ, erf 7, erf/ ^ crf£ -fr^Vi-^ ^^€-(stop codon)^ =l ^ 
-n-^7>^ A]2)-^&(start codonH (overlap)sH , crf£ -fr*L*|-fe- #iL3 

7>^( complementary strand)^ ^^-^-S. ^flt^. 

<54> 5E«h #7] 7>S.B) h^o] e ^7>» A^lt ^l^tt^. 

<55> ^flS^- ^B)^- -fi-^7>» 7l£: AJ-^^M ^l^^r 

oflAi A>-g-^ ^ ol^ 7l^r ^jtjfe -^-^7>^ ^^S. 

tfl^-fr A>-g-*|-$r ^-f^fe tfl#^ 4^l7l€^ 7MJ1 5fe tfl^-g- A>^>^ ^O] 

a>^^-Ji, JLS.» A>-§-§>^- ^-M^ 512.^ 4^71^^ 7HH 5fe 3:2.-8- ^« ^>-§-*> 

7A°) S^l. tfl^sj- Jl2.^ ^7l€^r 1" ^ 7V*|aI $X±± ^» ^ 

(shuttle vector)» A}-g-^- ^5= wH^tt <&*\<*<**\*= pCR-XL-T0P0 ^» 

a>-8-*M crt -fr^r* S^fe X|a*|-Sft^. °1» 'PCR-XL-TOPO crtfuH'°l 
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<57> # 7 H *i ^^flXfe- tfl^-g- JL2.» a>-§-^ ^ 31^, tfl#5£ XLl-Blue, TOPO, 
BL2KDE3) 3.^ l-&|i(codon plus), DH1 ^ DH5 a ^€ -^^s)^ 

S, 7l^|5)fe crt -fr3l*>«- 3.^-*}^ ^flS^- ^B] pCR-XL-TOPO crtfull^- tfttQ BL2KDE3) 

3^ M^afl -g-^« *U2^>£4. 



<59> 






<60> 


1) A)<i^jL 1^ cr^ ^TT 




<61> 


2) #7j] 1^ -fj-^*}* ^S^" ^ 




<62> 


3) #7fl 2^ ^flS^- ^E^l- ^Afl^ofl ig 


#31; 


<63> 







^.oj) AV-g-sl^ 3Me^ A>-g-m *r XLl-Blue, TOPO, BL2KDE3) S^- #&|^ 

(codon plus), DH1 ^ DH5 a S- 5.2.3.43 A>-g-^ ^o] y^^c).. 



46-15 



10^^^0023 °^}- 2004/5/7 

£ w>^tb ^li^r BL2KDE3) #B^* ^}-%-^}^. SE^b, *8" 

<65> #7H*| Bfloj:^. ^ 1 tfl*] #7fl 3^- ^ 5^* ^3-7>W *fl*l°1M 1*> 

-fK£»K inducer), ^l^M IPTG(isopropyl-beta-D-thiogalactopyranside)* ^7>S. ^7>^- ^ 
31 ?>.3.£| i^l JE. 7l^, ^l^tfl FPPCfarnesyl pyrophosphate), GGPP(geranylgeranyl 
diphosphate) 3=fe GPPCger any 1 pyrophosphate)* ^7>S. ^7>^- ^ Sl^r- #7l *fl<£^.£. 

Bji^ol^^ sl^r^- HPLC(high performance liquid chromatography) TLC(thin-layer 
chromatography)* °l-§-*H 1}"lH 4^ *r 



<66> ^ *\^^% ^H^IHfe ^"71 i&Tfl-i- ^ err -R-^r* 5L*r*r^ 5" 

^3.^ ^A><g ^4^^ ^Jtb HO #g/g(dry weight)* ^84r*M <>r^43r 

€ -*3^^r<?i sf^s^ii 3fl£tflA]i7> ^^V^ o^^QQSi] <£(25 #g/g(dry weight)) ^.4 

r 

<$*r«r*| *HMHS. ?}3.*)±o)B_ A^^-Aj -B-^7>* o]-§-^><^ tfl^o_3. o}^4^;Ei* $ 
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10^^0023 *^ 2004/5/7 

<67> £ ^ ^HH^I^ >3^}ofl :g_£_tr ^T^^^T -fr^*}* 

^ #S^*>7] £|^H sj-^S^^ «fl^cflAl>iS.^-Bl 7fli^q DNA bH ^-^^ (genomic DNA 

library)* ^r^rS^. Tfli^ DNA e^m^^r ^3H7fl 1HH t 1 

^^j. ^ &o_t3} ( t^H-^-S. ^i^lMfe- S^lH. ^je](cosmid vector)* °l-§-^H ^ll^^l 

DNA eMJie^S- T^^Sft 1 ^.. 
<68> cf^- -aX|«sfl«f||^fe *^ -tfJisKcolor complementation)' °l-§-"5r°} >fl 

DNA SMJiB^SJfEj 7>^£) ;to] n ^th^ -fr^UW* #S-^^>^4. 

7>3.b| icol = » a|aV^- ^ ^ #7fl ^rf. ^tfl, cr f£ crf£, crtl ^ 

t&, crf£, crftf, erf/, crr/T^ crtZ ^A7\ 3L^€ tfl^^ o>^4«* 

7fl^ DNA &r^l«.B]sl» 7>S.Bjic:o]jc ^Jg-^J 7]^«1 FPPCfarnesyl pyrophosphate )♦ ^7} 

^, 4 #3.^)3.^ ai^ls. ^)B]1- ^-^^H ^ i-H^^l DNA ^( insert)^ <*7l 

*)«g-g. ^58*>5atf. ZL ^3}-, 3.7) si\ DNA 6,223 bp^l 3.7}^ ^-SrS^. 

#7] DNA <g7H<£^ ^^3: IS. 7)*m%r%. 

<69> ighgS} 2E tf^- ^*HHH^ ^S.ElitO]c^. A^A]-^}^. -^^7>}7"r ^7fl*r^ ^ 

^S]^ 6,223bp^l DNA «g7H£-fr *4J«H 67fl^ 0RF« IH^^t. °l*^r NCBI 

GenBank iHH ^tr ^!*r, 4 -fr^)^ ^ FPPS.^ o>^b}^^^- 
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102^^0023 *^ ^*> : 2004/5/7 

£ 2 ifl*] £ 7 ^-S). o) ^g.if-&\ ^ ^T^s-^ ^ DNA 7>5.B] 

a). 

pCR-XL-TOPO-crtfull-i- ^1^>^^(£ 9 . #7] A^ ^» tftt^ 5L^H 

oi«>7l #7] H^**!!* nfl^H 5-^11* 3s ^-M: $7>«H 4*C«1H *V 

^•U- ^^S ^-^^^r -t^^M 190-900nm^ <H ^^4. ^ ^ 

oil-, £ ioofl ^-o] , p-^ys.^ ^^^o] jl-b- 3i) a« sfrsitl: a. 

^ ^7] #^^) HPLC ^r*8*|-Sft^-. °1 S.^ 

A]n*}(SigmaHH 6>^^^- A>^5|tJ.. zl ^-of, ^ 

ofl trj-e}- ^e)^ crt %-&7}7} H^^^} p-9>S.€^ <>}^&ig.-%r ^-SH"* 

^ 9X9X^. €■ ^^^^H^i 0}^^^^^ HO jug/g(dry weight 

rf . o]^ o^B]-^!:^^- ^J^}^ -g-^rSl ^^3.9]^ tft-&t%X\^7} ol^T^QQ-lk 25 #g/g(dry 
weight) ^S. ^ wlH^HH crt -fr^T.}* tfl^ofl % 



<71> 
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<?4> «fi^rtflA]i(KCCM-10460)^: PPES-II «W(B^^ 1 g/« , 1 

g /i; sQ^ AlH^IMB 0.01 g/& , 2 g/< §J 3 g/Z )<M 25T?<M 10H 

# Hfl^S^. 6 1^, *fl 6 <J=^-i: 13,000 rpm^IH ^15L» ^^51^. 4 

S^jjLEl 7fli^q DNA» ^:^«>7] STE ^^-§-^(10 mM Tris, 1 mM EDTA, 100 mM NaCl, pH 

8.0HH ^, 6810^1^ 15€-# St-i-^m^. €^-gr3*H ^, 

-§-<^ 1(50 mM tf3i, 25 mM Tris, 10 mM EDTA, pH 8. OH] ^H^^l^^h 5 mg/mi 5}°l^*r 
^(lysozyme)^ 100 ng/mi RNase A» ^7>-3r>JL ^91 SyC^l^i #-£*Mi=r. 

uH, cj-tifl^ ^isflj:^ (Proteinase) K* 250 j«gM ^7Hr ^ , 37TC°1H 3*1# t=t 

1^1^4. N-eb-fS-^to^KN-lauroylsarcosine)^: ^^MN l%7 r ^31, 37°C^1 

a| ^}-g-A]^c]-. o]^ > 2H^-#S.Si# (phenol-chloroform extraction) 0 *! Tfl^ 

DNA# ^^-534. ^ ^#^r 3 2wfl 100% *\]^ 

^ Tfl^ DNA* 31*1*1 5Q ^, 70% ofl^S. Afla^S^. i-H, TE 3r^-§- ^ -i: 

<75> <^A]cfl 2> Tfii^l DNA eMiL^ (Genomic DNA library) 
<76> <2-l> 3.^]^. ^eKcosmid vector) §«1 
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<77> S^E. ^jEl.(SuperCos 1 Cosraid Vector, Stratagene) 25 jug^l Xba 1(9 U/jUg) ^ltts^i 

# ^7>^-JL, ^-§-^<^l 200 ^«7> 5}7H «H 37°C°1H 1A1^> ^>-§-Al^ ^W^^. °1 

^, s|)^-#S.S.3£# DNA» ^re)^r ^, 100% «*fl^-§-3. 4131*1 ^ a I 

^ ^#^(dephosphorylation)*]7]7l 3}*fl CIAP 5L^(Pr omega)* ^7}^ 
37T:<>1H 30^: ^-<& ^r-l-^m^-. °}^, t%X| ^lt-tS£if ti|£|.g. ^-ft ^, 

100% ^SUl^. ^-7] ^5}^ ^Bj-i- cfA] BanR 1(5 U/jKg)°-3. 37 *C ^-i 1*1# * 

-S-«flA]^ l ng/nl7\ f;}^. 



<78><2-2> 711 ^ DNA 51-olti.alsl 

<79> #7l -^Al<4] jxf-e)-S7l^ fsfl^tflX)^ Tiling DNA 100 pg<% Sau3A 1(10 U)^ 

*|e)-aH Jli «H-Al^tK w>-§- ^ ^, 0.5 M EDTA* ^7>-gr}^x^-. o^, sfl^ 

-fSSIf 7fl^ DNA* $B]*V ^, 100% ^M^rS. 43 ^l^. ^f^r S-i 

€ Tflic^l DNA» TE ^-g-^Ml ^ *1 7] JL , ^7] 12}-^-^^; CIAP 1L±.% *\ 

■ 

Bl§H % SHtMSat}. 3)1 ^-l-s.s.5E-S- DNA» ^reltb 3s -#7] -M^Hl < 

2-1>^a-1 ^wl^ i2i*l:E. ^jBlfil- 5l-ol7llol^(iigation)Al7l7l ^-gfl, T4 ^lTlH^Uigase, 

i 

Promega), 10 >eJ-ol7flol^ ^-g-^(Promega)4 #3. 12^«>ll^i 18*12: ^-s^l^. ^>-§- 
^ ^-71 Bj-olTflol^ ^^-^(liation mixture)^- tfl^-^-^l XLl-Blue(Stratagene)^! ^€S^^> 
<*| 7flii^ DNA a)-o]ti.siejl- ^^>^4. 
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<80> <^aH] 3> -B-^>» ig-fi-SHr -3-^ M ^ 

<8i> LB agr Hfl*H i- o] ^ ^^-^ 7l^[ ^ FPP(Sigma)* 1%7> 

^7>^ ^, ^7] ^aH] 2^1^ 7\}^ ^<>l«.3lBl* #7] ^HH^l S-^SH 37°C<*fl 

T3fl<£€r$|t}-. ufl<#^ itffl^H-g: =1^ 137fl^| 2000 

7fl ^ 137fl). -3-71 137flSl #S.V|S.^-Bj 3^*1 H. ^Bll- ^r^^t^-. n^o] 

"1 £1^ *1€^ (primer working sequencing)* ^*§*H A 3^*1=. ^^H DNA ^€ 

51 <*7H<^§- ^^§>^cf. cl nfl ^71^3 ^-Br (^)^l^ofl SlSj^ ^^t}. 
<82> ^2)-, H. ^1^1 ofl ^-^^ DNA ^ ^^1^ 7}^ sj-^r 3.71 ^l DNA 6,223 

bpolSU^, 13- 7l7fl£l^ <37H14- °H ^71 ^^21 13. 7]*i)3^r <3 

7H<i* ^ S*}-5rJZ ^}b1» ' C0SCRT ' &h ^^rS4. 

<83><^aH1 4> 7>S.El^ol= a^aJ ^-^ -fr^r* DNA ^^1 &q 

<84> Aj-7] ^aHI 3<^H <£-£r DNA ^^1 ^<g^- NCBI 0RF Finder 55^^ 

(http://www.ncbi.nlm.nih.gov/gorf/gorf.html)* -M~g-SM 0RF«- ^SrS&^r. 
<85> zl ^aHI 3°lH ^ DNA^^I^ 67flsl 0RF7> 3.^-£)<H $X9X^. A ORF^- 

p-7 r 3.iffl TflSell o]S(p -carotene ketolase)* S^Srfe- crtW, p-?>3.31 -«H J= ^ 3*M S( 

J3-carotene hydroxylase)* -S^Vc- crrZ, el^^fl *r 0 l#3l 0 l^(licopene cyclase)* 
crrr, 3Z)sofl ^l^eflol^(phytoenedesaturase)lr -S^Sr^- err/, -tl H °1 2. 

(phytoene synthase)* i&^Kr crtB ^ ^Hj-^l^ ^S-i^lHH aJ Efl o] s 
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(geranylgeranyl pyrophosphate synthase)* S^Sfe crtE $\ ^7]^A ^ ^ 

<86> £tb, ^ 0RFS.^-t| -ff^s)^ ^Hk-i!: ^"l 3 ^ ^^sjafN^ ^{Alcaligenes 

sp.)(Misawa et al., Biochem. Biophys. Res. Commun. , 209(3): 867-876, 1995), ^^2=.^} 

MBradyrJj/zobjm sp.) (Hannibal ef a/., /. BacterioL, 182(13): 3850-3853, 2000) ^ 
§ej-^Ells)^- MFIavobacter/um sp.XPasamotes et al., Gene, 185(1): 35-41, 1997)^]^i 

yofl i^-eHS^. ojofl, ^ 67fl 0RF -fr^*}^ «37H£ * -fr^S) 

o>Plk^ -H^-g- -H^idi 2 tfl^| 13,2-3. ZJ-zJ- 7]^M^l t V. cr^F, cr^, cr^F, erf/ 

«j cr f£&) -fi-^Hfr zj-zj- 2, ^ili&J- 4, 6, ^<^ajjr. 8, Ait^J: 10 ^ 

Ai«i^iJ: 12«H, -SM -fr^r <>Mic<i- 3, ^"t^is 5, 7, 

a^I^ 9, ^<g^jL ii $ A^^j: i3^tq-. 

*r*r cr£ ^7>7> ^7fl^ JHl^; ^ $£534. 

<88> ^ erf -fr^r^ 5. 8°11 5.^}^. £. 8^1 ^r^r , A 

crtW, crtZ, crtY, crtl ^ crt&a= SL&ty 3-&°] ^ t SX^r. ^ 

*|, crCE ^3 7>^ 7>7.m ^ * *r Si-M , ^ ^ Afcnl, AJnal, 

<89> <^aH 5> tfl^oflA^ -B-^7>^ ^ ^ 
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HL=3^(HLpremix, Bioneer)» °l-g-*M PCR3. °1 °fl H^°H^ ^^J: 

14 ^ 153. 7l*fl£|Sr ^-g^VS^. ^t!:, PCR^r 94TC<*M 5 «- ^ 

DNA» €€^^m ^ , 1^-, 72t^H 6* ^ 94 0 C^1^ 30s, 30s ^ 

*V°l#(cycle)3. *H # 25 sj i2HH^3r w>*] 1^0.3. 72°C<*\}*\ 20 

^f-oV tfl-g-A]^ ^S^St^- PCR Topo-XL-vector(invitrogen)<H] ^^tb °1 

» ^^^l ^^S^VSt 1 ^. °1 ^ tfl^-^o.^.^ XLl-Blue(Stratagene), TOPO(Invitrogen) , 
BL2KDE3) #s}^(Stratagene), DHl(Takara) ^ DH5a (Takara)S ^>-§-*>^^. ^ ^4, 
BL2KDE3), XLl-Blue, BL2KDE3) 3.^ ^* *r 

BL2KDE3) §e^7> 7># ^ a}^^-* >$AHSfl iflfe ^-g- SHl^- *r 



^ -B-^l*}!- -^-^71- 1^&-§- ^}t|<y pCR-XL-TOPO ^(InvitrogeiM #«a«H °1» 
'pCR-XL-TOPO-crtfull'^m ^^*>^(S- 9). <#7) pCR-XL-T0P0-crtful 1 

BL2KDE3) ^S^ltl 3s =l*r W# ^^VS^. 

50 m4 LB tifl*HH 37TC, tifl^Sl^. 13,000 rpm<*IM € 

^Bl^H -$-f-^-§- v\?)3L, Sl^Ha^. T^^-i: 20 m£« 3J7>«H 

^(vortexing)t!: **** ^ ^^VSW. 13,000 rpm.5-3. ^M^: 

^, ^^^r ^-o]^}7l 190-900 nmS] ^ 
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^ 400-550 nm^ ^7} -SH?^ '-S-^S-S ^7fl^*>^cf. 44, 450 nm4 470 nm 

(S. 10). 

<92> iicj. $-7) 1 m«» #l*r°3 0.45 jt/m^ €^3. <^4tb & 

, HPLC ^Kcolumn: 4.6>§0mm, uBondapak C18, Waters, Milford, MA; mobile phase: 
acetonitrile-methanol-water(49:44:7 v/v), Flow: lOml/min, Detector: 470nra)^r 
^^r&^r. °1 ^ X|ZLT>KSigmaH*l n^tt £-7>3.€4 o^b}^^ A]~g-3r> 

<93> be^, ^^S-^-Bl o>^Bl-^!:^!^ 44, HO #g/g(dry 

weight SH13£i=r. <^ -R-5L*lH- ?>S.e] ^o] j= 7 14* ^7r^*l 

^•BjjoflA^ 44s^ ^<*fM 6,223 bp^ ^^a^^ tfl#3-°1H p-^VS.^^- 

i «fl^ : tflAl ; i;(7l^3i:KCCM-10460)7> 25 jag/g^ ^#*Kr «H €^ 

Jl41 

<94> #7HH #3^^ tf>4 ^-o] f £ ^-xgbflA^ d^E^^L-fr ^X^fe ^^\X\ 
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IHtt l] 

^3^6)] ^Att s 1 ^*}^ ^t^S: 2, ^l^s 4, ^t^J: 6, 

8, *\<i#3L 10 ^ A^^S 12S. 7)7%$$= 7l *\ <£3- ^€ ^S.^ ^^5)^ 

t 

[33 ^ 2] 

»1 llH P-?>S€ ^l^CP -carotene ketolase)* S^^-^ crr^ -fr^r 

[3^ 31 

^1 l^Hl ^°l A i, p-^r-S-tH *>o]je.-^^31°1^(P -carotene hydroxylase)* S^^Vb 
[^% 4] 

all I'M Bl-S^ A>o]^-sllo-|2:(iicopene cyclase)* S^Vfe- crtY SrQAQ ^ 

7]a^o] >H < l^i^ 6.2.3 7l*fl£}^- -fr^Tl-. 
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*fl l"^-°fl ^S.<a x^l^H^phytoene desaturase)* 3^]-^ crtl -rrQAQ 

*l] 11H1 ^M*!, ^S^l ^iBJl^l^Cphytoene synthase)* S^*}^ crtB -fr^^M 
\n=t^ 71 

*ll l^H SH-H, ^le}^^^^ sEjS-i^^l^lH -ti^H^Cgeranylgeranyl pyrophosphate 
synthase)* iS^Hfr cr^ -ft- #7.}^ ^l^o} ^ ^ *\<£#3l 125L ^fls)^ 

i^^ty 81 
91 
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101 . 

pCR-XL-TOPO-crtfull ^Si^r ^B). 
11] 

* 

*\) 10^ 7fl2:^- Ij^^t!: tftt ^M^l^WI. 

12] 

1) all 8%^ cr£ -fh^V» #3.^*Kr #31; 

2) #31 1^1 erf -^-^m *fl3^- *i|2:T5Hr ^1; 

4) #31 3^1 ^^J£^€ "3^r» ^^"M ^. tifl^ofl^^w.^ ^>S.El ice] S. # 5l^>^ 

#3|<?1 ?}SElH£t U^. 

MJt 1 * 13] 

3l 121N &<>H, 7flS^- *fl 10^ T^S^t ^B]o] ^ S.^O.S. -S>^ U" 
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14] 

I^D- 15] 

X] 12^ Sl^H, XI 11%^ cfl^-^ tifl^c^ -l Bfl^^^S.^ 7>3.l3i^l=« 3 
Wt 1 * 16] 
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2] 



1 



100 

P. haeuDflaesis (1) HSABAIJPHADLIATSLiySGGIIAASIALHVBALWI^ 

Alcaligenes_sp (1) HSGBKPGTTGDT IVNLGLT A AI UiCTLVLB AFTL¥LU)AAABPLLAVLCLftGLI-5IS VGLFII ftflliAHRCS VVPGKPBAHAAIGGLALB 

BraayrbizobiuSLSP (1) KB AAT AK AT EFG AS BBDD ABQRB7GLTL AA VI I A AflLVLBVGLHFF BPLT LBSLLP ALPLWLQTSLY YGLFI1 ABPCSUQS LYPFKPQVHBBI GQLCLF 
Consensus (1) HSA K A T V L LSAAIIAA9LVLHV LSFLDAAABPLLAIL LLGLT 9LS VGLF II ABDAHBGS WPGBPBAHA AI6QL L« 



101 



200 



P. naeundaesis (90) LYAQFSBBKHIVKHMABHBHTQTDDDPDFDBQ — QPVB«7ABFIGTTFQ?BEQELIiPYIVIVYALILQD-BBHYVVF?PI^SIl.ASIQI.FVrGTflU , HB 

Alcaligenes.sp (90) L? AGFS tfPKLI AK1HI HBRB AOTDHDPDFQHG — 6PVB«TOSFWTYFQ^EGUJ*VIVTTYALlLQD-BraYVIF^WAVLA5I0I^F8TeLPHB 

BradyrbizobiuiTsp (101) L? AQP3FD ALBVEflHKBHKBPQT AEDTOFDE?PPB3F¥HllFASFFLHYrefeKQV All A AVSLVYQLVF AYPLQBILLF9 ALPGLIS ALQLFTFUT Y LP HK 

Consensus (101) MAGES* KEITOH HBBB GIBDDPDFDBG GPVBBYASPI T YFGIIBEGLLI2TC Vl?Y ALILGD B UBYVIFlfP LP AI LASI QLFTI FGT9LPBB 

201 239 

P. haeundaesis (188) PGHDAFPBBBHABSSBISDPVSLLtCPHFQGYHHEHBLHPTVPB fiBLPST BTMDT A — 

Al call genes. sp (186) PGHJDPPpBHMBSTGIGDPIfl^ 

BradyfbtzobiuiLSP (201) P ATQPF AIiBBHABTSBFP ABIS LLTCFBF3-FHREHBLBPDAPB 5BLPHI KBBALEBBD 

Consensus (201) PGHD PPDBBBABSS I DPLSLLTCFBFGGYHBEBHLHP VPTOLP TBXKG A 



31 

1 100 

P. haeundaesis (1) tfHFUYVWWmiAWW»imHOTfl«ffl8HHBB8ra 

Alcaligenes_sp (1) waramimHttTlWWKW^ 
Consensus (1) HTHFIJ?VATYL?HELIA1SVBOTIHBGPLG1JOTHSHB2EBI)BALMm7GLWAVIATItFTVG V P VLTO ALGBT VYGII YFI LB DGL7 BQBfl 

101 1B3 
P. haeundaesis (101) PPBTIPBKGYARBt?QAHBESBA¥EGSDnCVSFGFIYAPP?DKLKQuI^TSG7I^ABAQEBT 
Alcaligenes_sp (101) PFBnPBBGWBBL^QABlLBBA?EGltDHC?SFGHlAPPVDKIiKQDLRB3G^LBPQDBRPS 
Consensus (101) PFBYIPBKGY BBLYQ ABBLBB AYBGBDBCVSFGFIY APPVDKLKQDLK SGVLB S 



41 



1 



100 



P. haeundaesis (1) YT BD VLLA8AGL AHGL1 &LALB AABPBLBVLLLDBA AGPS DGBT BSCDPPDLSPBiLABLKPLBB AS8PUQE VBFPBH ABI;L AT GYOS LDG A AL AD A? AB 
FlavobacteriuiLSP (1) HSBI/LLIA^AO&SGALTALAVSDBBPDAfdVHM 

Consensus (1) BSEDLLIAGA6LA ALIALALB BPD BILLLD AGPSD BTBSCBD DLSP SLABL PIBBA 1 DQBV FP HABBL TGY6SIDAAAL L 

101 200 
P. haeundaesis (101) SG AEIRBflSDIAELBBQ3A7 LSOGTBTB AG A7LDGBGAQPSBBLTVGFQKFVGVEIEI l£PHGVPRPKIHDATYTQQDCfl?ftrcYLLPF2PTEII.IBDT BY 

FlavobacteriuiLSP (100) H3m&1i!HTBVATlI»DTGATLTraS^^^ 

Consensus (101) G DIBSHS IA LSD GATLS GSBIEAA VIDABGA S BLTVGFQKFVGYEIETD PBGV BPHIBDATV Q DGYBFI YLLPFSPTBILIEBT BY 

201. 300 
P. haeundaesis (201) SIKyiBLIl^DALA AASBDYHSQQffSTfl ABVBBEBGILW AL ABUAA3FS ADHAEGP?V1?GLRR6FFHPY7Q xS LP YA AQVADVVAGLS6PPGT DALBGAIfe 
FlavobacteriuiLSP (199) SuGaDLDDGALAQASLDYAABBGBTflQBHEBBftflTLPl ALABDAIGFSBuBAWAVPVGLGAGLFBtYTGYSLPYA AQYADAIAAB-- DLTT AS ABBAVE 
Consensus (201) SUGG LED ALA AS DYA GWG EHBBEB8ILPI ALABDA GF* DBA G VPYGL AG FBPVTGYSLPYAAQVAD IAA TAB AIB 

301 387 

P. haeundaesis (SOI) DYAlDBABBDKFLRLUiRtttPBGCAP^ 

FlavobacteriuiLSP (297) OTAII«MI&BPimHBiILK^ 

Consensus (SOI) BAIDBA BDBFLBLLNBMLFBGC PDBBY LLQBFYBLP LIEBFYAGBLSLAD LBIVTGKPPIPL AIBCLPBBPLL E A 
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51 

1 100 
P. baeundaesis (1) HHABSPAAETXIYIGSCn-GGLALAIBLflSAGIATILVEAR^^^ 

FlavobacteriUB_sn (1) HSS AI YI GAl^(S tALAI BUKAGI AT TI Y2ABBKP6tiB AYVflHDOGHYPPAGPT VVTD?BSLBEL9ALSGCff}2UU*Vt LLPYSPFYBLTS AD 

Consensus (1) S IVIGAGFGGLALAIBMSASI ftTTI VEABDKPGQRAYYE DQGHVFDAGPTVITDPDALKELflAIiSGQ M EDVTLLPYSPFYRL B 

101 200 
P. baeunflaesls (101) GKYPDYYHEAWtSBQIAQfltt'imGraFBpYABEYYQEriYm^ 
Flavobacberlun_sp (94) (ffiSFHYKDDDBUBQYASFTOADVDOYBRFBDYAE^ 

Consensus (101) GK FDYYHD 0 L BQIA FHP DLDGYBBP DYAEEVY E8YLKLGT PFLKLGQHL AAPALMKL AYKSVBA VA FI DPBLBQ AFSFBT LL VG GH 

201 300 
p. baeundaesis (201) PFnSOTAUBALEKJmmR<BTlU3L?AGBV^^ 
Flavobacteriua_sp (134) PFnSCIYAIJHALffiBSGYff AECBTHQLYAC&YALPffi 

Consensus (201) PFSTSSnALIHALEBRGGVWAHGBTHQL?AQ8VAffBBIfl6 LLLBAKV RIDTEQ B TQVIL DOB LBAD VABSGDYHB YBDLLGBT BQ SK 

301 400 
P. haeunaaesls (301) AKSLDBKKKHSUrVLJHF(&BEAP©IABB^ 
Flavobacteriun_sp (294) AAILHBQR©8SIJVLHFCLSKBPmABHSnPaPBYK0XVHBTPIKIB^ 

Consensus (301) ALB RSSBSLFVLHFGL P IABBSIIFOPBYK LVNEIF OPEL DDFSLYLBSPC TDP LAP QHST BYYL AP VPBLGB ABIDE BAP Y 

401 SOO 
P. baeunflaesls (401) ADBlLASLFEBLIPIOAHLnTBIFTP?fiFASELllAHB^AFS'rePILTC£ AttFBPBKBDRIl^FYLVQAQlTBPQAQIPQVVOS ARATAQVfiLSDLAG 
Flavobacteriu*_sp (394) AERI FEELEBBAIPDLRKBLT VSBIFSP AD FS TE L3 ABBQS&FS VEP1LIQP A3FBPEKRDB AI PHFYTYGAGI BPGAGI PG?VG3 A£AT AQ VE I£ IiL AY 
Consensus (401) ADRI LE B IP LB LT SBIFSPADFASEL AH HQS AFS YEPILT OS AQFBPBHBDS I HFYIVGAGT HPG AGI PGYVGS AK AT AQ YBLS DL A 

501 

P. baeundaesis (501) A 

61 



I 100 

P. baeundaesis (1) HSOLYLTST EAIT OjKQSF AI AAELHPPGIRDPT YHLYAflCBB ABDYI W ALGSBPEAYN&PQARLDGUIVDTL AALQGDGPYI PPFA ALRAY ABB HDP 
FlavobacteriuiLSP (1) BTBLIATEEftAI AQdSt^QAAMBPPGlBFJnmVA^HAbDnUSQY 

Consensus (1) HSDL TS AI QGSQSFA A AXLHPPGI BDDT YHLY AE CRBADD VI DQQ LOS PEA DPQARL ALB DTLAAL DGPHSPPFAALB VABRBDF 

101 200 

P. baeuudaesis (101) m?ryiOBGFAKDVEAFflYETLDDYLI3CY^ 

Flavobacteriuii_SP (101) PDL¥PBfcIJEflFASDVADBEYBSLDD¥LE?S¥HYAti 

Consensus (101) P 9PHDLIEGFAHDV RD YRSLDD VLEYSYB? AGIVGVBH ABYHGV DD VLDBACDLQLAFQLTHIABDVIDDA IGBCYLPADBL AQA IDGPV 

201 300 
P. baeundaesis (201) ISPELYTYILRLLDEAEPYYASARYXODLPPECAlinAAALIBYRAIG^ 
FlaTObacteriun_sp (201) £SnALYST?n&U)AAEPYYWA!^LPELPPBCA«IAAALMYRtt^ 

Consensus (201) PS LYSVIIBLLD AEPYYASAB GL LPPBC WSI AAALBI YBAIG BIB GP AYBQBI ST SR AAEIGLLA GG D A SBL GA ISB GLQTB 

301 

* P. baeundaesis (301) PBHY 
Flavobacterlun_sp (301) PRA- 
Consensus (301) P 

P. haeundaesls (1) HRRDVHPIBATLLQIRLEEIAO^FGAVCQPLGAABSHGAIXSGRBFBG^ 
Flavobacteriun_sp (1) IffPKQQFPUiDLVEIBLAQISGQFGYYSAPLGAAISDAALCPGK^ 

Consensus (1) H H LL BL IA FG VS PL8AABS AALS GXBFBAHLHLL ABAS6GYCD IVDAACAVEHVBAASLIPDDLPQ1DDA BBG PAT 

101 200 
P. baeundaesis (101) B?MlGF£RAVIflGIALiTEA^LLAGAEGAS97V1$AQLV^ 
FlavobacteriuiLSP (101) BYAHGEGBAYLtoUALITEAHBILGEARGATPWBARLVASHSRAHG^^ 

Consensus (101) BVABGE BAVLAG2ALITEAH ILA ABGAS BA LV LSBALGP 6LCAGQDLDLBA K AACHE EQDLKTGVLPIAGLEBLAIIK D B Q 

201 295 
P. baeundaesis (201) fflDF(E0IMVFO3YDPLLPVV®QAALGKDTGB&AAAW^^ 
Fla vobacteriun.sp (201) LHAFGBOLGSVFrjS YP DU.D VI GDKAST GK DT ABLT AAPGHG3LH AVGQUGDVAQHYRASBftQLPELBKTBLPRGGQIAOLLABYLPBDIlJBS A 
Consensus (201) U FGBQLGBVFQSYDDLLDV1GD AA GKDTABD AAPGPK GLLW L VA BY ASBAOLD LLRSK A 1A LL BVLPB B 
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<110> KIM, Young Tae 



LEE, Jae Hyung 



biosynthesis of carotenoids <130> 



6223 <212> DNA <213> crt gene <400> 



1 



ALGENETECH <120> Genes involved in the 
15 <170> Kopatentln 1.71 <210> 1 <211> 



cgatgatggg 



3p-01-31 <160> 

1 gttccacgac tggggcatcc ccacgaccgc gtcgctgcgc gccatcgcgc 

120 



60 gccggaccgg gttctggtcg ggtcgggcgg ggtgcgtcac gggctggacg ccgcgcgggc 

180 cagcgccgag gccctgtccg 



catccgcctc ggcgcggacc tcgtggggca ggcggcccgc gcgctgcccg ccgcgcgcca 

240 gttctgcacc ggatcgggcg accttgcagc gctgcgctgc 



atcacctgtc cgacgtcgtg acccagctgc gcatcgcgat 

300 gccgggtggc caatggtcgc aagcaacggg gatggaaacc ggcgatgcgg gactgtagtc 

420 gaccgccacc agcctgatcg 



gcgcctctgc tggtgccggg 

360 tgcgcggatc gccggtccgg gggacaagat gagcgcacat gccctgccca aggcagatct 

480 gcatgcgctg tggtttctgg acgcggcggc gcatcccatc 



tctcgggcgg catcatcgcc gcgtggctgg ccctgcatgt 

540 ggggctgacc tggctgtcgg tcggtctgtt cttcatcgcg catgacgcga tgcacgggtc 

660 tgccggattt tcgtggcgca 



ctggcgatcg cgaatttcct 



600 ggtcgtgccg gggcgtccgc gcggcaatgc ggcgatgggc cagctggtcc tgtggctgta 
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agatgatcgt caagcacatg gcccatcacc gccataccgg 720 aaccgacgac gaccccgatt tcgaccatgg cggcccggtc 

cgctggtacg cgcgcttcat 780 cggcacctat ttcggctggc gcgaggggct gctgctgccc gtcatcgtga cggtctatgc 

840 gctgatcctg ggggatcgct ggatgtacgt ggtcttctgg ccgctgccgt cgatcctggc 900 gtcgatccag ctgttcgtgt 

tcggcacctg gctgccgcac cgccccggcc acgacgcgtt 960 cccggaccgc cataatgcgc ggtcgtcgcg gatcagcgac 

cccgtgtcgc tgctgacctg 1020 ctttcacttt ggtggttatc atcacgaaca ccacctgcac ccgacggtgc cttggtggcg 

1080 cctgcccagc acccgcacca agggggacac cgcatgacca atttcctgat cgtcgtcgcc 1140 accgtgctgg tgatggagtt 

gacggcctat tccgtccacc gttggatcat gcacggcccc 1200 ctgggctggg gctggcacaa gtcccaccac gaggaacacg 

accacgcgct ggaaaagaac 1260 gacctgtacg gcctggtctt tgcggtgatc gccacggtgc tgttcacggt gggctggatc 

1320 tgggcgccgg tcctgtggtg gatcgctttg ggcatgaccg tctatgggct gatctatttc 1380 gtcctgcatg acgggctggt 

tcatcagcgc tggccgttcc gctatatccc gcgcaagggc 1440 tatgcccgcc gcctgtatca ggcccaccgc ctgcaccacg 

cggtcgaggg acgcgaccat 1500 tgcgtcagct tcggcttcat ctatgcgccg ccggtcgaca agctgaagca ggacctgaag 

1560 acgtcgggcg tgctgcgggc cgaggcgcag gagcgcacgt gacccatgac gtgctgctgg 1620 caggggcggg ccttgcgaac 

gggctgatcg ccctggcgct gcgcgcggcg cggcccgacc 1680 tgcgggtgct gctgctggat catgcggcgg gaccgtcaga 

cggccatacc tggtcctgcc 1740 acgaccccga tctgtcgccg cactggctgg cgcggctgaa gcccctgcgc cgcgccaact 

1800 ggcccgacca ggaggtgcgc tttccccgcc atgcccggcg gctggccacc ggttacgggt 1860 cgctggacgg ggcggcgctg 

gcggatgcgg tggcccggtc gggcgccgag atccgctgga 1920 acagcgacat cgccctgctg gatgaacagg gggcgacgct 

gtcctgcggc acccggatcg . 1980 aggcgggcgc ggtcctggac gggcgcggcg cgcagccgtc gcggcatctg accgtgggtt 
2040 tccagaaatt cgtgggcgtc gagatcgaga ccgactgccc ccacggcgtg ccccgcccga 2100 tgatcatgga cgcgaccgtc 

acccagcagg acgggtaccg attcatctat ctgctgccct 2160 tctctccgac gcgcatcctg atcgaggaca ctcgctattc 

cgatggcggc aatctggacg 2220 acgacgcgct ggcggcggcg tcccacgact atgcccgcca gcagggctgg accggggccg 

2280 aggtccggcg cgaacgcggc atcctgccca ttgcgctggc ccatgacgcg gcgggcttct 2340 gggccgatca cgcggagggg 

cctgttcccg tgggactgcg cgcggggttc tttcacccgg 2400 tcaccggcta ttcgctgccc tatgcggcgc aggtggcgga 

cgtggtggcg ggcctgtccg 2460 ggccgcccgg caccgacgcg ctgcgcggcg ccatccgcga ttacgcgatc gaccgggcac 

2520 gccgtgaccg ctttctgcgc ctgctgaacc ggatgctgtt ccgcggctgc gcgcccgacc 2580 ggcgctatac cctgctgcag 

cggttctacc gcatgccgca tggactgatc gaacggttct 2640 atgccggccg gctgagcgtg gcggatcagc tgcgcatcgt 
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gaccggcaag cctcccattc 2700 cccttggcac ggccatccgc tgcctgcccg aacgtcccct gctgaaggaa aacgcatgaa 

2760 cgcccattcg cccgcggcca agaccgccat cgtgatcggc gcaggctttg gcgggctggc 2820 cctggccatc cgcctgcagt 

ccgcgggcat cgccaccacc ctggtcgagg cccgggacaa 2880 gcccggcggg cgcgcctatg tctggcacga tcagggccat 

gtcttcgacg cgggcccgac 2940 cgtcatcacc gaccccgatg cgctcaagga gctgtgggcg ctgaccgggc aggacatggc 

3000 gcgcgacgtg acgctgatgc cggtgtcgcc cttcfatcga ctgatgtggc cgggcgggaa 3060 ggtcttcgat tacgtgaacg 

aggccgatca gctggagcgc cagatcgccc agttcaaccc 3120 ggacgacctg gaaggatacc gccgcttccg tgattacgcg 

gaggaggtgt atcaggaggg 3180 ctacgtcaag ctgggcaccg tgcccttcct caagctgggc cagatgctca aggccgcgcc 

3240 cgcgctgatg aagctggagg cctataagtc cgtccatgcc aaggtcgcga ccttcatcaa 3300 ggacccctat ctgcggcagg 

cgttttcgta tcacacgctg ctggtgggcg ggaatccctt 3360 ctcgaccagc tcgatctatg cgctgatcca cgcgctggag 

cggcgcggcg gggtctggtt 3420 cgccaagggc ggcaccaacc agctggtcgc gggcatggtc gcgctgttcg aacggcttgg 

3480 cggccagatg atgctgaacg ccaaggtcgc ccggatcgag accgagggcg cgcggaccac 3540 gggcgtcacc ctggcggacg 

ggcggtcttt aagggccgac atggtcgcca gcaacggcga 3600 cgtcatgcac aactatcgcg acctgctggg ccacacggcc 

cgcgggcaga gccgcgcgaa 3660 atcgctggac cgcaagcgct ggtccatgtc gttgttcgtg ctgcatttcg gtctgcgcga 

3720 ggcgcccaag gacatcgcgc atcacaccat cctgttcggc ccccgctaca gggagctggt 3780 caacgagatc ttcaagggcc 

cgaagctggc cgaggatttc tcgctgtacc tgcattcgcc 3840 ctgcacgacc gatccggaca tggcgcctcc gggcatgtcc 

acgcattacg tgctggcccc 3900 cgtgccgcat ctgggccgcg ccgagatcga ttgggcggtc gaggggccgc gctatgccga 

3960 ccgcatcctg gcgtccctgg aggagcggct gatcccgaac ctgcgcgcca acctgaccac 4020 gacgcgcatc ttcacgcccg 

ccgatttcgc cagcgaactg aacgcccatc acggcagcgc 4080 cttctcggtc gagccgatcc tgacgcaatc cgcgtggttc 

cggccgcaca accgcgacaa 4140 gacgatccgc aacttctatc tggtcggcgc gggcacccat ccgggcgcgg gcattccggg 

4200 cgtcgtgggc tcggccaagg ccacggccca ggtgatgctg tccgacctgg cgggcgcatg 4260 agcgatctgg tcctgacctc 

gaccgaggcg atcacccaag ggtcgcaaag ctttgccacg 4320 gcggccaagc tgatgccgcc gggcatccgc gacgacacgg 

tgatgctcta tgcctggtgc 4380 cgccacgcgg atgacgtgat cgacggtcag gccctgggca gccgccccga ggcggtgaac 

4440 gacccgcagg cgcggctgga cggcctgcgc gtcgacacgc tggcggccct gcagggcgac 4500 ggtccggtga ccccgccctt 

tgccgcgctg cgcgcggtgg cgcggcggca tgatttcccg 4560 caggcctggc ccatggacct gatcgaaggc ttcgcgatgg 

atgtcgaggc gcgcgactat 4620 cgcacgctgg atgacgtgct ggaatattcc tatcacgtcg caggcatcgt cggcgtgatg 

4 
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4680 atggcccgcg tgatgggcgt gcgcgacgat cctgtcctgg accgcgcctg cgacctgggg 4740 ctggcgttcc agctgaccaa 

catcgcgcgc gacgtgatcg acgatgcgcg catcgggcgg 4800 tgctatctgc cgggggactg gctggaccag gcgggcgcgc 

ggatcgacgg gccggtgccg 4860 tcgccggagc tgtacacagt gatcctccgg ctgttggatg aggcggaacc ctattacgcg 

4920 tcggcgcggg tgggtctggc ggatctgcca ccgcgctgcg cctggtccat cgccgccgcg 4980 ctacggatct atcgcgccat 

cgggctgcgc atccgcaaga gcgggccgca ggcctatcgc 5040 cagcggatca gcacgtccaa ggctgccaag atcggcctgc 

tgggcgtcgg gggctgggat 5100 gtcgcgcgat cacgcctgcc gggggcgggc gtgtcgcggc agggcctctg gacccggccg 

5160 catcacgtct aggcgcgcgc ggcgtagggc agaacccgtt ccagcagggc cgcgatttcc 5220 ggagcctgaa ggcgcttgct 

gcgcagcatc gcgtccagtt gggcgcggct ggcctcgtaa 5280 tgacgggaca cgttctgcag gtctgacacg gccagaaggc 

cgcgccgcgg gccgggggcc 5340 gcggcatcgc gaccggtatc cttgccaagc gccgcctggt cgcccacgac gtccagcagg 

5400 tcgtcatagg actggaacac gcggcccagc tgacggccaa agtcgatcat ctgggtctgc 5460 tcctcggcgt cgaactcctt 

gatcacggcc agcatctcca gcccggcgat gaacagcacg 5520 ccggtcttca ggtcctgttc ctgttcgacc cccgcgccgt 

tcttggccgc gtgcaggtcc 5580 aggtcctggc cggcgcacag gccctgcggc cccagggacc gcgacaggat ccgcaccagc 

5640 tgcgcccgca ccgtgcccga cgcgccgcgc gcaccggcca gcagggccat tgcctcggtg 5700 atcagggcga tgccgcccag 

cacggcacgg ctttcgccat gcgccacatg ggtcgcgggc 5760 cggccgcggc gcagcccggc atcgtccatg cagggcaggt 

* 

cgtcgaagat cagcgatgcg 5820 gcatgcacca tctcgaccgc gcaggcggcg tcgacgatcg tgtcgcagac cccgcccgag 

5880 gcctctgccg caagcagcat cagcatgccg cggaaccgcc tgcccgacga cagcgcgcca 5940 tggctcatgg ccgcgccgag 

cggctgcgac acggcaccga atccctgggc gatctcctca 6000 agtctggtct gcagaagggt ggcgtggatc gggttgacgt 

ctcgtctcat cagtgccttc 6060 gcgcttgggt tctgacctgg cgggaaggtc aggccggggc ggcaccccgt gacccgtcat 

6120 ccaccgtcaa cagtccccat gttggaacgg ttcacgcccg attgcgagcc ttttcgacgg 6180 cgacgcgggg tcgcgcggca 

atttgtccaa caaggtcagt gga 6223 <210> 2 <211> 729 <212> DNA <213> crtW 

gene <400> 2 atgagcgcac atgccctgcc caaggcagat ctgaccgcca ccagcctgat cgtctcgggc 60 ggcatcatcg 

ccgcgtggct ggccctgcat gtgcatgcgc tgtggtttct ggacgcggcg 120 gcgcatccca tcctggcgat cgcgaatttc 

ctggggctga cctggctgtc ggtcggtctg 180 ttcttcatcg cgcatgacgc gatgcacggg tcggtcgtgc cggggcgtcc 

gcgcggcaat 240 gcggcgatgg gccagctggt cctgtggctg tatgccggat tttcgtggcg caagatgatc 300 

gtcaagcaca tggcccatca ccgccatacc ggaaccgacg acgaccccga tttcgaccat 360 ggcggcccgg tccgctggta 
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cgcgcgcttc atcggcacct atttcggctg gcgcgagggg 420 ctgctgctgc ccgtcatcgt gacggtctat gcgctgatcc 

tgggggatcg ctggatgtac 480 gtggtcttct ggccgctgcc gtcgatcctg gcgtcgatcc agctgttcgt gttcggcacc 

540 tggctgccgc accgccccgg ccacgacgcg ttcccggacc gccataatgc gcggtcgtcg 600 cggatcagcg accccgtgtc 

gctgctgacc tgctttcact ttggtggtta tcatcacgaa 660 caccacctgc acccgacggt gccttggtgg cgcctgccca 

gcacccgcac caagggggac 720 accgcatga 

729 <210> 3 <211> 242 <212> PRT <213> crtW amino acid <400> 3 Met Ser Ala His Ala Leu Pro 
Lys Ala Asp Leu Thr Ala Thr Ser Leu 1 5 10 15 He Val 

Ser Gly Gly He He Ala Ala Trp Leu Ala Leu His Val His 20 25 

* 

30 Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro He Leu Ala He Ala 35 40 

45 Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe Phe He Ala 50 55 

60 His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Gly Asn 65 70 
75 80 Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp 8 

90 95 Arg Lys Met He Val Lys His Met Ala His His Arg His Thr Gly Thr 100 

105 110 Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 115 

120 125 Arg Phe He Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 130 

135 140 Val lie Val Thr Val Tyr Ala Leu He Leu Gly Asp Arg Trp Met Tyr 145 

150 155 160 Val Val Phe Trp Pro Leu Pro Ser He Leu Ala Ser He Gin Leu 

Phe 165 170 175 Val Phe Gly Thr Trp Leu Pro His Arg Pro 

Gly His Asp Ala Phe Pro 180 185 190 Asp Arg His Asn Ala Arg 

Ser Ser Arg He Ser Asp Pro Val Ser Leu 195 200 205 Leu Thr Cys 

Phe His Phe Gly Gly Tyr His His Glu His His Leu His 210 215 220 Pro 

Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 225 230 235 

240 Thr Ala <210> 4 <2U> 489 <212> DNA <213> crtZ gene <400> 4 atgaccaatt tcctgatcgt 
cgtcgccacc gtgctggtga tggagttgac ggcctattcc 60 gtccaccgtt ggatcatgca cggccccctg ggctggggct 

ggcacaagtc ccaccacgag 120 gaacacgacc acgcgctgga aaagaacgac ctgtacggcc tggtctttgc ggtgatcgcc 
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180 acggtgctgt'tcacggtggg ctggatctgg gcgccggtcc tgtggtggat cgctttgggc 240 atgaccgtct atgggctgat 

> 

ctatttcgtc ctgcatgacg ggctggttca tcagcgctgg 300 ccgttccgct atatcccgcg caagggctat gcccgccgcc 

tgtatcaggc ccaccgcctg 360 caccacgcgg tcgagggacg cgaccattgc gtcagcttcg gcttcatcta tgcgccgccg 

420 gtcgacaagc tgaagcagga cctgaagacg tcgggcgtgc tgcgggccga ggcgcaggag 480 cgcacgtga 

489 <210> 5 <211> 162 <212> PRT <213> crtZ amino acid <400> 5 Met Thr Asn Phe Leu He Val 

Val Ala Thr Val Leu Val Met Glu Leu 15 10 15 Thr Ala 

Tyr Ser Val His Arg Trp He Met His Gly Pro Leu Gly Trp 20 25 

30 Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 35 40 

45 Asn Asp Leu Tyr Gly Leu Val Phe Ala Val He Ala Thr Val Leu Phe 50 55 

60 Thr Val Gly Trp He Trp Ala Pro Val Leu Trp Trp He Ala Leu Gly 65 70 

75 80 Met Thr Val Tyr Gly Leu He Tyr Phe Val Leu His Asp Gly Leu Val 8 

90 95 His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr Ala Arg 100 

105 110 Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 115 

120 125 His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 130 

135 140 Lys Gin Asp Leu Lys Thr Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 145 

150 155 160 Arg Thr <210> 6 <211> 1161 <212> DNA <213> crtY 

gene <400> 6 gtgacccatg acgtgctgct ggcaggggcg ggccttgcga acgggctgat cgccctggcg 60 ctgcgcgcgg 

cgcggcccga cctgcgggtg ctgctgctgg atcatgcggc gggaccgtca 120 gacggccata cctggtcctg ccacgacccc 

gatctgtcgc cgcactggct ggcgcggctg 180 aagcccctgc gccgcgccaa ctggcccgac caggaggtgc gctttccccg 

ccatgcccgg 240 cggctggcca ccggttacgg gtcgctggac ggggcggcgc tggcggatgc ggtggcccgg 300 

tcgggcgccg agatccgctg gaacagcgac atcgccctgc tggatgaaca gggggcgacg 360 ctgtcctgcg gcacccggat 

cgaggcgggc gcggtcctgg acgggcgcgg cgcgcagccg 420 tcgcggcatc tgaccgtggg tttccagaaa ttcgtgggcg 

tcgagatcga gaccgactgc 480 ccccacggcg tgccccgccc gatgatcatg gacgcgaccg tcacccagca ggacgggtac 

540 cgattcatct atctgctgcc cttctctccg acgcgcatcc tgatcgagga cactcgctat 600 tccgatggcg gcaatctgga 

cgacgacgcg ctggcggcgg cgtcccacga ctatgcccgc 660 cagcagggct ggaccggggc cgaggtccgg cgcgaacgcg 
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gcatcctgcc cattgcgctg 720 gcccatgacg cggcgggctt ctgggccgat cacgcggagg ggcctgttcc cgtgggactg 

780 cgcgcggggt tctttcaccc ggtcaccggc tattcgctgc cctatgcggc gcaggtggcg 840 gacgtggtgg cgggcctgtc 

cgggccgccc ggcaccgacg cgctgcgcgg cgccatccgc 900 gattacgcga tcgaccgggc acgccgtgac cgctttctgc 

gcctgctgaa ccggatgctg 960 ttccgcggct. gcgcgcccga ccggcgctat accctgctgc agcggttcta ccgcatgccg 

1020 catggactga tcgaacggtt ctatgccggc cggctgagcg tggcggatca gctgcgcatc 1080 gtgaccggca agcctcccat 

tccccttggc acggccatcc gctgcctgcc cgaacgtccc 1140 ctgctgaagg aaaacgcatg a 

1161 <210> 7 <211> 386 <212> PRT <213> crtY amino acid <400> 7 Val Thr His Asp Val Leu Leu 

Ala Gly Ala Gly Leu Ala Asn Gly Leu 1 5 10 15 He Ala - 

Leu Ala Leu Arg Ala Ala Arg Pro Asp Leu Arg Val Leu Leu 20 25 

30 Leu Asp His Ala Ala Gly Pro Ser Asp Gly His Thr Trp Ser Cys His 35 40 

45 Asp Pro Asp Leu Ser Pro His Trp Leu Ala Arg Leu Lys Pro Leu Arg 50 55 

• * 

60 Arg Ala Asn Trp Pro Asp Gin Glu Val Arg Phe Pro Arg His Ala Arg 65 70 
75 80 Arg Leu Ala Thr Gly Tyr Gly Ser Leu Asp Gly Ala Ala Leu Ala Asp 8 

90 95 Ala Val Ala Arg Ser Gly Ala Glu He Arg Trp Asn Ser Asp He Ala 100 

105 110 Leu Leu Asp Glu Gin Gly Ala Thr Leu Ser Cys Gly Thr Arg lie Glu 115 

120 125 Ala Gly Ala Val Leu Asp Gly Arg Gly Ala Gin Pro Ser Arg His Leu 130 

135 140 Thr Val Gly Phe Gin Lys Phe Val Gly Val Glu He Glu Thr Asp Cys 145 

150 155 160 Pro His Gly Val Pro Arg Pro Met He Met Asp Ala Thr Val Thr 

Gln 165 170 175 Gin Asp Gly Tyr Arg Phe He Tyr Leu Leu 

Pro Phe Ser Pro Thr Arg 180 185 190 He Leu He Glu Asp Thr 

Arg Tyr Ser Asp Gly Gly Asn Leu Asp Asp 195 200 205 Asp Ala Leu 

Ala Ala Ala Ser His Asp Tyr Ala Arg Gin Gin Gly Trp 210 215 220 Thr 

Gly Ala Glu Val Arg Arg Glu Arg Gly lie Leu Pro He Ala Leu 225 230 235 

240 Ala His Asp Ala Ala Gly Phe Trp Ala Asp His Ala Glu Gly Pro Val 245 



250 255 Pro Val Gly Leu Arg Ala Gly Phe Phe His Pro Val Thr Gly Tyr Ser 



260 
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265 270 Leu Pro Tyr Ala Ala Gin Val Ala Asp Val Val Ala Gly Leu Ser Gly 275 

280 285 Pro Pro Gly Thr Asp Ala Leu Arg Gly Ala lie Arg Asp Tyr Ala He 290 

295 300 Asp Arg Ala Arg Arg Asp Arg Phe Leu Arg Leu Leu Asn Arg Met Leu 305 

310 - 315 320 Phe Arg Gly Cys Ala Pro Asp Arg Arg Tyr Thr Leu Leu Gin Arg 

Phe 325 330 335 Tyr Arg Met Pro His Gly Leu He Glu Arg 

Phe Tyr Ala Gly Arg Leu 340 345 350 Ser Val Ala Asp Gin Leu 

Arg He Val Thr Gly Lys Pro Pro lie Pro 355 360 365 Leu Gly Thr 

Ala He Arg Cys Leu Pro Glu Arg Pro Leu Leu Lys Glu 370 375 380 Asn 

Ala 385 <210> 8 <211> 1506 <212> DNA <213> crtl gene <400> 8 atgaacgccc attcgcccgc 
ggccaagacc gccatcgtga tcggcgcagg ctttggcggg 60 ctggccctgg ccatccgcct gcagtccgcg ggcatcgcca 

ccaccctggt cgaggcccgg 120 gacaagcccg gcgggcgcgc ctatgtctgg cacgatcagg gccatgtctt cgacgcgggc 

180 ccgaccgtca tcaccgaccc cgatgcgctc aaggagctgt gggcgctgac cgggcaggac 240 atggcgcgcg acgtgacgct 

gatgccggtg tcgcccttct atcgactgat gtggccgggc 300 gggaaggtct tcgattacgt gaacgaggcc gatcagctgg 

agcgccagat cgcccagttc 360 aacccggacg acctggaagg ataccgccgc ttccgtgatt acgcggagga ggtgtatcag 

420 gagggctacg tcaagctggg caccgtgccc ttcctcaagc tgggccagat gctcaaggcc 480 gcgcccgcgc tgatgaagct 

ggaggcctat aagtccgtcc atgccaaggt cgcgaccttc 540 atcaaggacc cctatctgcg gcaggcgttt tcgtatcaca 

cgctgctggt gggcgggaat 600 cccttctcga ccagctcgat ctatgcgctg atccacgcgc tggagcggcg cggcggggtc 

660 tggttcgcca agggcggcac caaccagctg gtcgcgggca tggtcgcgct gttcgaacgg 720 cttggcggcc agatgatgct 

gaacgccaag gtcgcccgga tcgagaccga gggcgcgcgg 780 accacgggcg tcaccctggc ggacgggcgg tctttaaggg 

ccgacatggt cgccagcaac 840 ggcgacgtca tgcacaacta tcgcgacctg ctgggccaca cggcccgcgg gcagagccgc 

900 gcgaaatcgc tggaccgcaa gcgctggtcc atgtcgttgt tcgtgctgca tttcggtctg 960 cgcgaggcgc ccaaggacat 

cgcgcatcac accatcctgt tcggcccccg ctacagggag 1020 ctggtcaacg agatcttcaa gggcccgaag ctggccgagg 

atttctcgct gtacctgcat 1080 tcgccctgca cgaccgatcc ggacatggcg cctccgggca tgtccacgca ttacgtgctg 

1140 gcccccgtgc cgcatctggg ccgcgccgag atcgattggg cggtcgaggg gccgcgctat 1200 gccgaccgca tcctggcgtc 

cctggaggag cggctgatcc cgaacctgcg cgccaacctg 1260 accacgacgc gcatcttcac gcccgccgat ttcgccagcg 
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aactgaacgc ccatcacggc 1320 agcgccttct cggtcgagcc gatcctgacg caatccgcgt ggttccggcc gcacaaccgc 

1380 gacaagacga tccgcaactt ctatctggtc ggcgcgggca cccatccggg cgcgggcatt 1440 ccgggcgtcg tgggctcggc 

caaggccacg gcccaggtga tgctgtccga cctggcgggc . 1500 gcatga 

1506 <210> 9 <211> 501 <212> PRT <213> crtl amino acid <400> 9 Met Asn Ala His Ser Pro Ala 

Ala Lys Thr Ala He Val He Gly Ala 15 10 15 Gly Phe 

Gly Gly Leu Ala Leu Ala He Arg Leu Gin Ser Ala Gly He 20 25 

30 Ala Thr Thr Leu Val Glu Ala Arg Asp Lys Pro Gly Gly Arg Ala Tyr 35 40 

45 Val Trp His Asp Gin Gly His Val Phe Asp Ala Gly Pro Thr Val He 50 55 

60 Thr Asp Pro Asp Ala Leu Lys Glu Leu Trp Ala Leu Thr Gly Gin Asp 65 70 

75 80 Met Ala Arg Asp Val Thr Leu Met Pro Val Ser Pro Phe Tyr Arg Leu 8 

90 95 Met Trp Pro Gly Gly Lys Val Phe Asp Tyr Val Asn Glu Ala Asp Gin 100 

105 110 Leu Glu Arg Gin lie Ala Gin Phe Asn Pro Asp Asp Leu Glu Gly Tyr 115 

120 125 Arg Arg Phe Arg Asp Tyr Ala Glu Glu Val Tyr Gin Glu Gly Tyr Val 130 

135 140 Lys Leu Gly Thr Val Pro Phe Leu Lys Leu Gly Gin Met Leu Lys Ala 145 

150 155 160 Ala Pro Ala Leu Met Lys Leu Glu Ala Tyr Lys Ser Val His Ala 

Lys 165 170 175 Val Ala Thr Phe lie Lys Asp Pro Tyr Leu 

Arg Gin Ala Phe Ser Tyr 180 185 190 His Thr Leu Leu Val Gly 

Gly Asn Pro Phe Ser Thr Ser Ser He Tyr 195 200 205 Ala Leu lie 

His Ala Leu Glu Arg Arg Gly Gly Val Trp Phe Ala Lys 210 215 220 Gly 

Gly Thr Asn Gin Leu Val Ala Gly Met Val Ala Leu Phe Glu Arg 225 230 235 

240 Leu Gly Gly Gin Met Met Leu Asn Ala Lys Val Ala Arg He Glu Thr 245 

250 255 Glu Gly Ala Arg Thr Thr Gly Val Thr Leu Ala Asp Gly Arg Ser Leu 260 

265 270 Arg Ala Asp Met Val Ala Ser Asn Gly Asp Val Met His Asn Tyr Arg 275 

280 285 Asp Leu Leu Gly His Thr Ala Arg Gly Gin Ser Arg Ala Lys Ser Leu 290 

295 300 Asp Arg Lys Arg Trp Ser Met Ser Leu Phe Val Leu His Phe Gly Leu 305 
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310 315. 320 Arg Glu Ala Pro Lys Asp lie Ala His His Thr He Leu Phe Gly 

n 

m 

, p ro 325 330 335 Arg Tyr Arg Glu Leu Val Asn Glu He Phe 

Lys Gly Pro Lys Leu Ala * 340 345 350 Glu Asp Phe Ser Leu Tyr 

Leu His Ser Pro Cys Thr Thr Asp Pro Asp 355 360 365 Met Ala Pro 

Pro Gly Met Ser Thr His Tyr Val Leu Ala Pro Val Pro 370 375 380 His 

Leu Gly Arg Ala Glu lie Asp Trp Ala Val Glu Gly Pro Arg Tyr 385 390 395 

400 Ala Asp Arg He Leu Ala Ser Leu Glu Glu Arg Leu He Pro Asn Leu 405 
410 415 Arg Ala Asn Leu Thr Thr Thr Arg lie Phe Thr Pro Ala Asp Phe Ala 420 

425 430 Ser Glu Leu Asn Ala His His Gly Ser Ala Phe Ser Val Glu Pro He 435 

440 445 Leu Thr Gin Ser Ala Trp Phe Arg Pro His Asn Arg Asp Lys Thr lie 450 

455 460 Arg Asn Phe Tyr Leu Val Gly Ala Gly Thr His Pro Gly Ala Gly He 465 

470 475 480 Pro Gly Val Val Gly Ser Ala Lys Ala Thr Ala Gin Val Met Leu 

Ser 485 490 495 Asp Leu Ala Gly Ala 500 

210> 10 <211> 915 <212> DNA <213> crtB gene <400> 10 atgagcgatc tggtcctgac ctcgaccgag 
gcgatcaccc aagggtcgca aagctttgcc 60 acggcggcca agctgatgcc gccgggcatc cgcgacgaca cggtgatgct 

ctatgcctgg 120 tgccgccacg cggatgacgt gatcgacggt caggccctgg gcagccgccc cgaggcggtg 180 

aacgacccgc aggcgcggct ggacggcctg cgcgtcgaca cgctggcggc cctgcagggc 240 gacggtccgg tgaccccgcc 

ctttgccgcg ctgcgcgcgg tggcgcggcg gcatgatttc 300 ccgcaggcct ggcccatgga cctgatcgaa ggcttcgcga 

tggatgtcga ggcgcgcgac 360 tatcgcacgc tggatgacgt gctggaatat tcctatcacg tcgcaggcat cgtcggcgtg 

* 

420 atgatggccc gcgtgatggg cgtgcgcgac gatcctgtcc tggaccgcgc ctgcgacctg 480 gggctggcgt tccagctgac 

caacatcgcg cgcgacgtga tcgacgatgc gcgcatcggg 540 cggtgctatc tgccggggga ctggctggac caggcgggcg 

cgcggatcga cgggccggtg 600 ccgtcgccgg agctgtacac agtgatcctc cggctgttgg atgaggcgga accctattac 

660 gcgtcggcgc gggtgggtct ggcggatctg ccaccgcgct gcgcctggtc catcgccgcc 720 gcgctacgga tctatcgcgc 

catcgggctg cgcatccgca agagcgggcc gcaggcctat 780 cgccagcgga tcagcacgtc caaggctgcc aagatcggcc 

tgctgggcgt cgggggctgg 840 gatgtcgcgc gatcacgcct gccgggggcg ggcgtgtcgc ggcagggcct ctggacccgg 
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900 ccgcatcacg tctag 



915 <210> 11 <2H> 304 



<212> PRT <213> crtB amino acid <400> 11 Met Ser Asp Leu Val Leu Thr Ser Thr Glu Ala lie Thr Gin 



Gly Ser 1 



10 



15 Gin Ser Phe Ala Thr Ala Ala Lys Leu 



Met Pro Pro Gly lie Arg Asp 



20 



25 



30 Asp Thr Val Met Leu 



Tyr Ala Trp Cys Arg His Ala Asp Asp Val lie 



35 



40 



45 Asp Gly 



Gin Ala Leu Gly Ser Arg Pro Glu Ala Val Asn Asp Pro Gin 



50 



55 



60 



Ala Arg Leu Asp Gly Leu Arg Val Asp Thr Leu Ala Ala Leu Gin Gly 65 



70 



80 Asp Gly Pro Val Thr Pro Pro Phe Ala Ala Leu Arg Ala Val Ala Arg 



85 



95 Arg His Asp Phe Pro Gin Ala Trp Pro Met Asp Leu He Glu Gly Phe 



100 



105 



110 Ala Met Asp Val Glu Ala Arg Asp Tyr Arg Thr Leu Asp Asp Val Leu 



115 



120 



125 Glu Tyr Ser Tyr His Val Ala Gly He Val Gly Val Met Met Ala Arg 



130 



135 



140 Val Met Gly Val Arg Asp Asp Pro Val Leu Asp Arg Ala Cys Asp Leu 145 



150 



155 



160 Gly Leu Ala Phe Gin Leu Thr Asn He Ala Arg Asp Val He Asp Asp 



165 



170 



175 Ala Arg He Gly Arg Cys Tyr Leu Pro Gly Asp Trp Leu Asp Gin 



Ala 



180 



185 



190 Gly Ala Arg lie Asp Gly Pro Val Pro Ser Pro 



Glu Leu Tyr Thr Val 



195 



200 



205 He Leu Arg Leu Leu Asp Glu Ala 



Glu Pro Tyr Tyr Ala Ser Ala Arg 



210 



215 



220 Val Gly Leu Ala Asp Leu 



Pro Pro Arg Cys Ala Trp Ser He Ala Ala 225 



230 



235 



240 



Ala Leu Arg He Tyr Arg Ala He Gly Leu Arg He Arg Lys Ser Gly 



245 



250 



255 Pro Gin Ala Tyr Arg Gin Arg He Ser Thr Ser Lys Ala Ala Lys He 



260 



265 



270 Gly Leu Leu Gly Val Gly Gly Trp Asp Val Ala Arg Ser Arg Leu Pro 



275 



280 



285 Gly Ala Gly Val Ser Arg Gin Gly Leu Trp Thr Arg Pro His His Val 



290 



295 



300 <210> 12 <211> 882 <212> DNA <213> crtE gene <400> 12 atgagacgag acgtcaaccc gatccacgc 



acccttctgc agaccagact tgaggagatc 



60 gcccagggat tcggtgccgt gtcgcagccg ctcggcgcgg ccatgagcca 



tggcgcgctg 



120 tcgtcgggca ggcggttccg cggcatgctg atgctgcttg cggcagaggc ctcgggcggg 



180 
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gtctgcgaca cgatcgtcga cgccgcctgc gcggtcgaga tggtgcatgc cgcatcgctg 240 atcttcgacg acctgccctg 

catggacgat gccgggctgc gccgcggccg gcccgcgacc 300 catgtggcgc atggcgaaag ccgtgccgtg ctgggcggca 

tcgccctgat caccgaggca 360 atggccctgc tggccggtgc gcgcggcgcg tcgggcacgg tgcgggcgca gctggtgcgg 

420 atcctgtcgc ggtccctggg gccgcagggc ctgtgcgccg gccaggacct ggacctgcac 480 gcggccaaga acggcgcggg 

ggtcgaacag gaacaggacc tgaagaccgg cgtgctgttc 540 atcgccgggc tggagatgct ggccgtgatc aaggagttcg 

acgccgagga gcagacccag 600 atgatcgact ttggccgtca gctgggccgc gtgttccagt cctatgacga cctgctggac 

660 gtcgtgggcg accaggcggc gcttggcaag gataccggtc gcgatgccgc ggcccccggc 720 ccgcggcgcg gccttctggc 

cgtgtcagac ctgcagaacg tgtcccgtca ttacgaggcc 780 agccgcgccc aactggacgc gatgctgcgc agcaagcgcc 

ttcaggctcc ggaaatcgcg 840 gccctgctgg aacgggttct gccctacgcc gcgcgcgcct ag 

882 <210> 13 <211> 293 <212> PRT <213> crtE amino acid <400> 13 Met Arg Arg Asp Val Asn Pr 
He His Ala Thr Leu Leu Gin Thr Arg 1 5 10 15 Leu Glu 

Glu lie Ala Gin Gly Phe Gly Ala Val Ser Gin Pro Leu Gly 20 25 

■ 

30 Ala Ala Met Ser His Gly Ala Leu Ser Ser Gly Arg Arg Phe Arg Gly 35 40 

45 Met Leu Met Leu Leu Ala Ala Glu Ala Ser Gly Gly Val Cys Asp Thr 50 55 

« 

60 He Val Asp Ala Ala Cys Ala Val Glu Met Val His Ala Ala Ser Leu 65 70 

75 80 He Phe Asp Asp Leu Pro Cys Met Asp Asp Ala Gly Leu Arg Arg Gly 8 

90 95 Arg Pro Ala Thr His Val Ala His Gly Glu Ser Arg Ala Val Leu Gly 100 

105 110 Gly He Ala Leu He Thr Glu Ala Met Ala Leu Leu Ala Gly Ala Arg 115 

120 125 Gly Ala Ser Gly Thr Val Arg Ala Gin Leu Val Arg lie Leu Ser Arg 130 

135 140 Ser Leu Gly Pro Gin Gly Leu Cys Ala Gly Gin Asp Leu Asp Leu His 145 

150 155 160 Ala Ala Lys Asn Gly Ala Gly Val Glu Gin Glu Gin Asp Leu Lys 

T hr 165 170 175 Gly Val Leu Phe He Ala Gly Leu Glu Met 

Leu Ala Val He Lys Glu 180 185 190 Phe Asp Ala Glu Glu Gin 

Thr Gin Met He Asp Phe Gly Arg Gin Leu 195 200 205 Gly Arg Val 

Phe Gin Ser Tyr Asp Asp Leu Leu Asp Val Val Gly Asp 210 215 220 Gin 
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Ala Ala Leu G.ly Lys Asp Thr Gly Arg Asp Ala Ala Ala Pro Gly 225 230 235 

240 Pro Arg Arg Gly Leu Leu Ala Val Ser Asp Leu Gin Asn Val Ser Arg 245 

250 255 His Tyr Glu Ala Ser Arg Ala Gin Leu Asp Ala Met Leu Arg Ser Lys 260 

265 270 Arg Leu Gin Ala Pro Glu He Ala Ala Leu Leu Glu Arg Val Leu Pro 275 

280 285 Tyr Ala Ala Arg Ala 290 <210> 14 <211> 19 <212> DNA <213 

Artificial Sequence <220> <223> forward primer for crt gene <400> 14 gttccacgac tggggcatc 
19 <210> 15 <211> 28 <212> DNA <213> Artificial Sequence <220> <223> reverse primer for cr 
gene <400> 15 tccactgacc ttgttggaca aattgccg 28 
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